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Abstract 

A method is presented for evaluating authors on the basis of citations. It assigns 
to each author a citation score which depends upon the number of times he is cited, 
and upon the scores of the citers. The scores are found to be the components of 
an eigenvector of a normahzed citation matrix. The same method can be apphed to 
citation of journals by other journals, to evaluating teams in a league [1], etc. 



1 Introduction 

One commonly used measure of the influence of an author is the number of times his work 
is referred to by others in a given period of time. For a scientiflc author, this number can be 
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found by counting the number of citations of his work hsted in the Science Citation Index 
for that period. However, this measure fails to take into account who is doing the citing. 
A citation by an influential author ought to carry more weight than one by an unknown 
author, but a simple count of citations does not give it more weight. A more appropriate 
measure would be a weighted count of the citations, in which each citation is weighted by 
some measure of the influence of the citer. We shall show how to find such a measure, which 
we call a citation score, or just score for short. 

We begin by assuming that each author can be assigned a score, and that the weight of 
a citation is proportional to the score of the citer. Then we determine the score of an author 
by adding up the weights of all the citations of his work. The circularity of this method leads 
to a requirement of consistency among the scores, which determines them as the solutions 
of a system of linear equations. This method also applies to the evaluation of journals on 
the basis of citations of them in other journals. 

We have used this methods before [1] to evaluate teams in a league, with Cij the number of 
times that team i beats team j. Other ranking methods are reviewed by Moon and Pullman 



2 The citation matrix 

Let us consider authors, numbered from 1 to N. We denote by cf- the number of times 
that j cited i, omitting self citations, so that c-j = 0. The c[j form an A by A^ square matrix 
C, which we call the citation matrix. 
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The j-th column of C records all the citations by j of others, and the sum of the entries 
in that column is the total number of citations by j. If the sum is not zero, we normalize 
the column by dividing each entry by the column sum. Thus we define Cij by 

If all the entries in column j are zero, we define = 0. We call the matrix with entries 
the normalized citation matrix C. From the definition ([1]) we see that Cjj is the fraction of 
j's citations which refer to i. Furthermore, each column sum is unity, unless all the entries 
in the column are zero, in which case it is zero. 

Next we denote by Xi the score of author %. According to the method mentioned in the 
Introduction, Xi is given by 

Xi A ^ ^ ^ij'^j' (^) 

i=i 

Here is a factor of proportionality. Thus Xi is the sum of contributions from all citers 
j i. Each contribution is the product of the score of j times the fraction of j's citations 
which refer to i, all times A~^. 

Equation (j2j) is a system of linear homogenous equations for the scores Xt. In terms 
of the score vector x = {xi, . . . ,xn), ^ can be written 

Cx = \x. (3) 

Thus the score vector x is an eigenvector of the normalized citation matrix C corresponding 
to the eigenvalue A. 
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3 Eigenvectors of the citation matrix 

In order for x to be a score vector, its components must be non-negative numbers. Since all 
the entries Cij of C are non-negative, Theorem 3 on p. 66 of Gantmacher [3] shows that C 
has a real non-negative eigenvalue A with a non-negative eigenvector. 

This non-negative eigenvector x could be used to determine the scores if it were unique, 
aside from a constant factor. It will certainly be unique if C is irreducible, i.e., if it cannot 
be put in the following form by a permutation of the indices: 



Here A and D are square matrices. 

When C is irreducible, Frobenius' theorem states that it has a real non-negative eigen- 
value A which is larger than the modulus of any other eigenvalue. Furthermore the eigenvec- 
tor X corresponding to A is unique up to a scalar factor, and all its components are positive. 
(Gantmacher, p. 53, theorem 2.) In this case the components of x, with some suitable 
normalization, can be used as the scores. 

The eigenvalue A can be determined by first summing ([2]) over i. Then from the fact that 
C is normalized, it follows that the sum of Cij over i is unity. The other possibility, that all 
the Cij in one column vanish, cannot occur when C is irreducible. Thus we obtain from ([2]) 



A 



C 



(4) 



V 



B D 





Now ([5]) shows that A = 1. 
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Since A = 1, we can write ([3]) in the form 



Cx = X. 



(6) 



We also impose the normahzation condition that the largest component of x is unity; 



The previous considerations show that when C is irreducible, (P) and ([7]) have a unique 
solution in which all the components Xi are positive. They are the scores we wanted. 

4 The reducible case 

Let us now suppose that the normalized citation matrix C is reducible, and that it has been 
put in the form (jl]). If = 0, C is block diagonal, which means that there are two distinct 
sets of authors with no references by members of either set to the works of those in the 
other set. This might be the case if the two sets of authors write about completely separate 
fields. When C is block diagonal, it has at least two linearly independent eigenvectors, one 
corresponding to A and another to D. It is not surprising that in this case the scores of the 
two sets of authors are unrelated. 

Next we suppose that B 0, and that both A and D are irreducible. Then still 
holds, so A = 1. Now we write x in the partitioned form x = {y,z), and ([3]) becomes the 
pair of equations 



max Xi = 1. 



(7) 



Ay = y, 
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By + Dz = z. 



(9) 
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The matrix D-I in (Q is singular because A = 1 is an eigenvalue of D. Furthermore, co, 
the corresponding left eigenvector of D, has all positive components. The inner product of 
uj with ([9]) yields the solvability condition By = 0. Since all the entries in B are non- 
negative, while u and y are positive, uP^By > unless y = 0. Thus y = and then Q has 
a positive solution. Therefore when C is reducible to the form (j4]) with A and D irreducible 
and B ^ 0, there is a unique normalized score vector x = {y, z) in which y = 0. 

The conclusion that y = for all the authors corresponding to A is unsatisfactory, and 
suggests that the reducible case should be treated differently. 
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